NeuTTS Air is the world's first ultra-realistic, device-side text-to-speech (TTS) language model with instant voice cloning capabilities. Built on a 0.5B large language model backbone network, it can bring natural voice, real-time performance, built-in security features, and speaker cloning capabilities to local devices.
Audio Processing
Safetensors